Activation functions in deep learning: A comprehensive survey and benchmark

نویسندگان

چکیده

Neural networks have shown tremendous growth in recent years to solve numerous problems. Various types of neural been introduced deal with different However, the main goal any network is transform non-linearly separable input data into more linearly abstract features using a hierarchy layers. These layers are combinations linear and nonlinear functions. The most popular common non-linearity activation functions (AFs), such as Logistic Sigmoid, Tanh, ReLU, ELU, Swish Mish. In this paper, comprehensive overview survey presented for AFs deep learning. Different classes Sigmoid Tanh based, ReLU ELU Learning based covered. Several characteristics output range, monotonicity, smoothness also pointed out. A performance comparison performed among 18 state-of-the-art on data. insights benefit researchers doing further research practitioners select choices. code used experimental released at: https://github.com/shivram1987/ActivationFunctions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

a contrastive study of rhetorical functions of citation in iranian and international elt scopus journals

writing an academic article requires the researchers to provide support for their works by learning how to cite the works of others. various studies regarding the analysis of citation in m.a theses have been done, while little work has been done on comparison of citations among elt scopus journal articles, and so the dearth of research in this area demands for further investigation into citatio...

Nonparametrically Learning Activation Functions in Deep Neural Nets

We provide a principled framework for nonparametrically learning activation functions in deep neural networks. Currently, state-of-the-art deep networks treat choice of activation function as a hyper-parameter before training. By allowing activation functions to be estimated as part of the training procedure, we expand the class of functions that each node in the network can learn. We also prov...

متن کامل

Online Learning: A Comprehensive Survey

Online learning represents an important family of machine learning algorithms, in which a learner attempts to resolve an online prediction (or any type of decision-making) task by learning a model/hypothesis from a sequence of data instances one at a time. The goal of online learning is to ensure that the online learner would make a sequence of accurate predictions (or correct decisions) given ...

متن کامل

assessment of deep word knowledge in elementary and advanced iranian efl learners: a comparison of selective and productive wat tasks

testing plays a vital role in any language teaching program. it allows teachers and stakeholders, including program administrators, parents, admissions officers and prospective employers to be assured that the learners are progressing according to an accepted standard (douglas, 2010). the problems currently facing language testers have both practical and theoretical implications but the first i...

A Comprehensive Survey of Deep Learning in Remote Sensing: Theories, Tools and Challenges for the Community

In recent years, deep learning (DL), a re-branding of neural networks (NNs), has risen to the top in numerous areas, namely computer vision (CV), speech recognition, natural language processing, etc. Whereas remote sensing (RS) possesses a number of unique challenges, primarily related to sensors and applications, inevitably RS draws from many of the same theories as CV; e.g., statistics, fusio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Neurocomputing

سال: 2022

ISSN: ['0925-2312', '1872-8286']

DOI: https://doi.org/10.1016/j.neucom.2022.06.111